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Abstract 

Factors that contribute to the transmission of human immunodeficiency virus type 1 (HIV-I), especially drug-resistant HIV-1 
variants remain a significant public health concern. In-depth phylogenetic analyses of viral sequences obtained in the 
screening phase from antiretroviral-nalve HIV-infected patients seeking enrollment in EPZl 08859, a large open-label study 
in the USA, Canada and Puerto Rico (ClinicalTrials.gov NCT00440947) were examined for insights into the roles of drug 
resistance and epidemiological factors that could impact disease dissemination. Viral transmission clusters (VTCs) were 
initially predicted from a phylogenetic analysis of population level HIV-1 pol sequences obtained from 690 antiretroviral- 
naTve subjects in 2007. Subsequently, the predicted VTCs were tested for robustness by ultra deep sequencing (UDS) using 
pyrosequencing technology and further phylogenetic analyses. The demographic characteristics of clustered and non- 
clustered subjects were then compared. From 690 subjects, 69 were assigned to 1 of 30 VTCs, each containing 2 to 5 
subjects. Race composition of VTCs were significantly more likely to be white (72% vs. 60%; p = 0.04). VTCs had fewer reverse 
transcriptase and major PI resistance mutations (9% vs. 24%; p = 0.002) than non-clustered sequences. Both men-who-have- 
sex-with-men {MSM) (68% vs. 48%; p = 0.001) and Canadians (29% vs. 14%; p = 0.03) were significantly more frequent in 
VTCs than non-clustered sequences. Of the 515 subjects who initiated antiretroviral therapy, 33 experienced confirmed 
virologic failure through 144 weeks while only 3/33 were from VTCs. Fewer VTCs subjects (as compared to those with non- 
clustering virus) had HIV-1 with resistance-associated mutations or experienced virologic failure during the course of the 
study. Our analysis shows specific geographical and drug resistance trends that correlate well with transmission clusters 
defined by HIV sequences of similarity. Furthermore, our study demonstrates the utility of molecular and epidemiological 
analysis of VTCs for identifying population-specific risks associated with HIV-1 transmission and developing effective local 
healthcare strategies. 
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Introduction 

With the availability of higUy active antiretroviral therapy 
(HAART) transmission of drug resistant human immunodeficiency 
virus type 1 (HIV-I) quasi-species has become of increasing 
concern. While drug resistance acquired through vertical trans- 
mission has been well studied, considerably less is known about 



epidemiology surrounding the transmission of drug resistance in 
adults. Using phylogenetic analyses of HIV-I pol sequences from 
HIV-infected subjects, highly similar groups of viruses can be 
inferred to be clusters of related infections, or viral transmission 
clusters (VTCs). Previous phylogenetic studies have explored HIV 
transmission networks among subjects within a relatively confined 
geographical area [1-19] with fewer attempts to determine 
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transmission dynamics in a large multi-site or an international 
setting [20-23]. With the advent of the more sensitive next 
generation sequencing (NGS) methodologies, ultra deep sequenc- 
ing (UDS) is being increasingly used to detect low abundance viral 
variants derived from clinical viral isolates [24]. 

The viral sequence interrelationships and epidemiological data 
from antiviral-naive subjects seeking treatment can provide a 
framework for better understanding the population dynamics 
behind drug resistant virus and of HIV-1 transmission networks 
which may span large geographic regions. In this study we used a 
phylogenetic approach to identify VTCs in ART-naive subjects 
infected with highly related HIV. These subjects, from the 
continental United States (USA), Canada and Puerto Rico, were 
seeking HAART treatment through enrollment in the open-label, 
randomized clinical trial ARIES (Atazanavir-Ritonavir Induction 
with Epzicom Study; EPZ 108859) [25-27]. We examined 
demographic differences between HIV-infected subjects using 
Sanger and NGS DNA sequencing technologies along with 
phylogenetic analyses. Responses to therapy for subjects from 
VTCs were compared to those subjects with non-clustered 
sequences to better understand the population dynamics behind 
the spread of HIV infection within study subje[:ts and aid in future 
design of treatment strategies. 

Although our population was restricted to only patients who 
choose to seek treatment through climcal trial enrollment, it is in 
agreement with other studies that found the majority of VTCs 
tend to occur within close geographic proximity and to be 
associated with men-wh()-ha\'e-sex-with-men (MSM) behaviors. 
Our study is consistent with reports from other VTC studies that 
HAART is highly effective in treating this population. 

Materials and Methods 

Ethics Statement 

All study subjects provided written informed consent. This study 
was approved by the ethics review boards used by for the 72 
participating centers, which included Ethica Clinical Research 
Inc., MUHC Biomedical D, Sunnybrook Health (Hth) Sciences 
Centre (CTR) Research (Res) Ediics Board (B), University (U) Hth 
Network Res Ethics B, Copernicus Group IRB, Henry Ford Hth 
System (S) Res Administration, U of South Florida IRB, U and 
Medical (Med) CTR IRB of East Carolina U, U of Toledo 
Department (Dept) for Human (HU) Res Protections(P) Biomed- 
ical IRB, Whitman-Walker CKnic IRB, Summa Hth S Hospitals 
(Hp) IRB, Western IRB, U of Kentucky Office (O) of Res 
Integrity, HU Subjects (SU) Res Committee (MN), Florida Dept of 
Hth IRB, Kaiser Permanente CTR for Hth Res/Southeast, U of 
Southern California Hth Sciences Campus IRB, Georgetown 
University IRB, St. Joseph's Hp IRB, Chesapeake Research 
Review, Inc., U of Texas Med Branch O of Res SU P IRB, Loyola 
U Med CTR IRB for the P of HU SU, Washington U in St. Louis 
HU Res P O, U of Chicago IRB, U of Southern California School 
of Medicine IRB, Partners HU Res O, New York Med College 
Committee for P of HU SU, Thomas Jefferson IRB, Miriam Hp 
Clinical Res RB, U of Pennsylvania IRB, and the AIDS Res 
Consortium of Atlanta, Inc. IRB. This study was conducted in 
accordance with Good CUnical Practice. 

Study Design 

Blood plasma-derived HIV samples used in the genotypic 
analysis were obtained from HIV-infected, ART-naive subjects 
during their screening \isit to determine eligibility for study 
enrollment in ARIES. HIV genotyping was performed prospec- 
tively on all subjects at screening as an appropriate HIV 



population genotype obtained by conventional sequencing was a 
study eligibility criterion. FuU descriptions of the study design for 
ARIES, the subject enrollment criteria and the clinical results from 

this study have been published previously [26,27]. 

A sumary of the clinical trial study primar)' and secondary outcome 
measures and their results are also available at http:/ / cHnicsiltrials.gov/ 
ct2/home (ClinicalTrials.gov Identifier NCT00440947). All pre- 
therapy samples were obtained during 2007, from a total of 72 centers 
in the mainland USA, Puerto Rico, and Canada. Subjects were eligible 
for enrollment if they met all study endy criteria. Primary entry criteria 
included being: 1) HIV-infected; 2) ART-naive; 3) a minimum of 18 
years of age; 4) plasma HTV RNA > 1 ,000 copies/mL with any CD4+ 
lymphocyte count and, 5) an appropriate HIV genotype. Subjects were 
ineligible if they had any of the following conditions: 1) HLA-B'''5701 
allele or hepatitis B surface antigen positive; 2) medical conditions that 
could compromise their safety or interfere with drug absorption; 3) 
required use of prohibited medications or; 4) protocol-specified 
abnormal laboratory values or a creatinine clearance estimated by 
the Cockcroft-Gault equation of <50 mL/min. AU enrolled subjects 
initially received ritonaw-boosted atazanaw plus abacavir/lamivu- 
dine. Subjects with HIV-RNA suppression levels below 50 copies/mL 
by week 30 were randomized 1:1 at week 36 to either continue their 
original regimen or discontinue ritonavir for up to 1 08 additional weeks 
of treatment. 

DNA Sequence Analysis 

Blood plasma-derived pre-therapy HIV reverse transcriptase 
(RT) and protease regions of the pol gene were analyzed 
prospectively by population genotyping at the screening visit 
using TRUGENE (Research Think Tank; GA, USA). The HIV 
sequence analyzed by TRUGENE region included protease 
amino acids (AA) 4—99 and reverse transcriptase (RT) AA 38- 
247. Samples from subjects with confirmed virologic failure or 
cVF (defined as failure to suppress HIV-1 RNA to <400copies/ 
mL by Week 36 or confirmed virologic rebound to >400copies/ 
mL after suppression to <400copies/mL at any time) during 
ARIES were analyzed by Phenosense GT at baseline and time of 
cVF (Monogram Biosciences South San Francisco, California, 
USA). Nucleoside reverse transcriptase inhibitors (NRTIs), non- 
nucleoside reverse transcriptase inhibitors (NNRTIs) and protease 
inhibitor (PI) resistance-associated mutations were defined accord- 
ing to International Antiviral Society USA (lAS-USA) Guidelines 
from March 2013 (http://www.iasusa.org/sites/default/files/ 
tam/21-l-6.pdf). 

HIV RT (reverse transcriptase) and protease for the pre-therapy 
time-point was also analyzed by UDS as performed by 454 Life 
Sciences/Roche (Branford, CT, USA) using proprietary primers. 
The HIV sequences analyzed by UDS included protease AA 10- 
99 and RT AA 1-251. Median UDS sample read depth over all 
amino acid positions was 3273 reads (VTC subjects median 
sample read depth/amino acid position range: 101-15529) and 
frequency results were censored at with a read depth SIOO. 

Quest Diagnostics (Van Nuys, California, USA) performed all 
other laboratory tests. HIV RNA concentrations were measured 
by the Roche Cobas Amphcor HFV-l Monitor or Ultrasensitive 
Monitor Test (Roche Diagnostic Systems, Branchburg, NJ). Clade 
analysis was performed using RIP 3.0 (http://www.hiv.lanl.gov/ 
content/sequence/RIP/RIP.html). RIP was configured to per- 
form query alignments against the HIV database standard HFV-l 
subtype consensus alignments. Each pol sequence was analyzed 
with a sUding window of decreasing sizes 200,150, and 100 
nucleotides (nt) until a subtype could be determined or was marked 
as unknown. 
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Phylogenetic Reconstruction and VTC Prediction 

HIV-l RT and protease nucleotide sequences obtained by 
population genotype sequencing from ART-naive subjects at the 
screening visit were concatenated and subsequendy aligned using 
the software MUSCLE [28]. The resulting alignment was used to 
reconstruct a neighbor joining (NJ) tree using the programs 
DNADIST (default F84 matrix) and NEIGHBOR from the 
PHYLIP 3.67 package [29]. In estimating genetic distances, the 
F84 model incorporates different rates of transition and transver- 
sion and also allows for different frequencies of the four 
nucleotides. HIV-1 Subtype K sequence AJ24935 was used as 
an outgroup for the tree and 1000 bootstrap replicates were used 
to estimate node support. A VTC was defined as any operating 
taxonomic unit (OTU) cluster having 2 or more sequences with a 
median confidence estimate of &95% of 1000 bootstrap replicates 
using a custom Perl script. The choice of classifying OTUs based 
on the median rather than an absolute value threshold allowed 
some leniency for comparing VTC predictions across datasets. 

HIV samples from ARIES participants identified by population 
sequencing as being in VTCs were sent for further HIV RT and 
protease UDS analysis along with pre-therapy samples from all 
ARIES cVF participants and a random subset selected from all 
ARIES participants with remaining pre-therapy HIV samples. 
Sequence concatenation, alignment and phylogenetic analyses 
were as described above for all samples. A single consensus 
sequence was generated for each subject in this dataset using the 
majority nucleotide sequence at each codon. Phylogenetic trees 
were visualized and drawn using Archaeopteryx v0.98I3 tree 
viewer [30]. 

We combined the two resultant phylogenetic analyses to further 
filter our final selection of VTCs by stipulating that any VTC must 
have at least two shared participants predicted by both population 
and UDS datasets. The minimum shared participants were 
considered to be high confidence VTC predictions and were 
analyzed further for demographic trends. 

Descriptive statistics were primarily used in presenting the data, 
although Mann-Whitney U test (2-tailed) was used for comparing 
population medians. Data comparisons between subjects with 
clustered versus those with non-clustered virus were carried out 
using the Fisher's exact test (2-tailed). 

Results 

HIV RT and protease population genotyping was prospectively 
attempted using plasma samples collected from 726 subjects who 
were initially screened for the study. HIV population genotype was 
obtained from a total of 690 unique ART-naive subjects from sites 
located in the continental United States, Puerto Rico and Canada 
and an appropriate genotype was a requirement for later study 
enrollment. These 690 subjects were from 72 sites; each site 
recruited a median of 8 subjects (range 1-52 subjects). For the 36 
samples for which there were no initial population genotype data, 
typically no population genotype was obtained post analysis 
although for a small subset of subjects two genotypes were 
obtained. The most recent genotype was used if a subject had been 
allowed to rescreen for the study after failing to meet other entry 
criteria and was assigned a new subject identification. UDS 
analysis was performed after the initial phylogenetic analysis of the 
population sequence on a subset of 350 samples, which included 
baseline or screen samples from all subjects identified as being in a 
VTC by population genotype phylogenetic analysis, all subjects 
with cVF and a randomly selected subset of 250 study subjects. 
UDS results were obtained for 325 baseline or screen samples. No 
results were obtained for some samples due to technical limitations 



of the assay (typically insufficient plasma or HIV-1 RNA <2000 
copies/mL). 

Most subjects (664/690; 96%) were infected with subtype B 
virus, with further subjects detected with viral subtypes A (n = 6), C 
(n= 11), D (n= 1), G (n= 7) or indeterminate (n= 1). 

Comparisons of VTCs Predictions from Two Datasets 

A total of 40 VTCs, comprising virus from 91 subjects, were 
predicted by NJ tree reconstruction using the population 
sequencing dataset (Table SI). In comparison, NJ analysis of the 
UDS dataset, yielded 31 VTC predictions representing viruses 
from 71 subjects. A total of 15 viral genotypes from subjects who 
were part of VTCs in the NJ population sequence tree were not 
identified by UDS analysis as being in these same clusters. 
Conversely, a total of 15 genotypes from subjects who were part of 
VTCs in the UDS phylogenetic analysis were not similarly 
clustered by population sequencing. 

The majority of the predicted VTCs by either method 
overlapped; however there were 10 and 1 VTCs predicted by 
the population and UDS analyses, respectively, which were not 
identified by the other method. In total, 30 VTCs (composed of 69 
subjects) were predicted by both methods and considered here as 
high confidence VTC predictions or VTCs (Figxire 1). Composite 
NJ trees using both datasets are shown in Figure 2 and Figure S 1 . 

Comparison between the Non-clustered Subject and VTC 
Subject Populations 

The epidemiological and pre-treatment clinical characteristics 
for the overall cohort population, VTCs and non-clustered 
subjects are given in Table 1. The subjects within VTCs were 
demographically similar in many respects to the non-clustered 
population. However, they were significantly more likely to have 
reported MSM as a transmission risk (p = 0.001) and be from 
Canadian study centers (p = 0.003). VTC were significantiy more 
likely to be white (72% vs. 60%; p = 0.04) and non-VTCs were 
significantiy more likely to be black (22% vs. 34%; p - 0.04). There 
were no VTCs where the subjects did not list any risk factor for 
acquiring HIV. 




NGS 

Dataset 



Figure 1. Prediction of VTCs by two studies using HIV-1 
sequences obtained from pre-tlierapy subjects. Shown are 
comparisons of VTC numbers as predicted by NJ tree-reconstruction 
of either population-based, ultra-deep sequencing (UDS) or both 
datasets. See Methods and IVlaterials for VTC prediction methodology. 
doi:1 0.1 371/journal.pone.008961 1 .gOOl 
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Figure 2. Neighbor-joining (NJ) tree based on population-genotyped HIV sequences from 690 pre-tKierapy subjects. High-confidence 
VTCs are labeled In red (1-30). These VTCs all represent 2 or more clustered subjects with a minimum confidence level of 95% In the (1000 bootstrap 
NJ replicates) for both population and UDS datasets. VTCs predicted only by the population or UDS data are shown In green and blue, respectively. 
VTCs where sequences are subsets of population or UDP VTCs have multiple colored labels. A more detailed phylogenetic tree with the OTUs labeled 
appears In Figure SI. 
dol:1 0.1 371/journal.pone.008961 1 .g002 



Composition of Subjects in High Confidence VTCs 

Subject compositions within each of the 30 high confidence 
VTCs are given in Table 2. Almost all of the VTCs (97%) 
included men while only 5 female participants were found across 4 
VTCs. Of these 4 VTCs, one contained only women (one African- 
descent USA subject and one African-descent Canadian subject) 
and 2 contained one female and one male (all white USA 
participants), while the remaining female-containing VTC was 
composed of 4 participants (one African-descent female and 3 
white males; all from different USA cities). Six VTCs were 
multiracial in composition, while 18 VTCs included only white 
participants and 6 VTCs were exclusively of African descent. 

Women within the VTCs reported only heterosexual HIV 
infection risks. Risk factors for the 64 men in VTCs included 
MSM (70%) and heterosexual contact (13%) or both (3%) MSM 
and heterosexual contact risk factors. Four VTCs were composed 
of participants with only heterosexual contact listed as an HIV 
transmission risk while 5 VTCs, including one of the largest-sized 
5 member clusters, had participants who noted either heterosexual 



and/or MSM contact as transmission risks. The majority of VTCs 
(80%) contained at least one subject with MSM risk factor. 

The majority of VTCs were composed of subjects all receiving 
antiretroviral therapy (ART) treatment in the same city (57%) or 
state /province (63%). However, 3 VTCs contained subjects from 
both Canada and the USA (including Puerto Rico). The majority 
of the VTCs (29/30) identified by both DNA sequencing analyses 
were infected with subtype B HIV-1 while the remaining VTC 
was comprised of 3 white males and one African Heritage female; 
all from the USA who were infected with subtype A HIV- 1 . 

The majority (87%) of the 30 high confidence VTCs were 
comprised of 2 subjects while 4/30 VTCs were composed of >2 
subjects including 2 VTCs with 5 subjects, and VTCs with 3 and 4 
subjects. The comparison of these VTCs containing >2 subjects is 
summarized in Table 3. 

All subjects in these larger VTCs were males, except for a single 
female in a 4 member VTC. Most males reported MSM or no risk 
factors while the female subject reported only heterosexual risk 
factors. In 3 of these larger-sized VTCs at least 2 of the members 
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Table 1. Comparison of population demographics across study cohort and viral transmission clusters (VTC). 







Totsl cohort 




Non-clust6r6d Subjects 




ARIES Cohort 


(n = e90) 


VTC Subjects (n = 69) 


(n = 621) 


p-value 


Median Age (years) 


38 


36 


38 


0.29' 


Male Gender 


582 (84%) 


64 (93%) 


518 (83%) 


0.05'' 


Median HIV-RNA (log,oCopies/mL) 


5.05 


5.05 


5.05 


0.90' 


HIV-RNA SI 00,000 copies/mL 


322 


30 


292 


0.61" 


Median CD4 (cells/mm^) 


203.5 


215 


203 


0.29' 


CDC Classification 


n = 516 


n = 60 


n = 456 




CDC Class A 


352 (68%) 


45 (75%) 


307 (67%) 


0.30" 


CDC Class B 


98 (19%) 


8 (13%) 


90 (20%) 


0.29" 


CDC Class C 


66 (13%) 


7 (12%) 


59 (13%) 


i.oo" 


Race 


n = 689 


n=69 


n = 620 




White 


419 (61%) 


50 (72%) 


369 (60%) 


0.04" 


Black/ African descent 


226 (33%) 


15 (22%) 


211 (34%) 


0.04" 


Asian 


14 (2%) 


2 (3%) 


12 (2%) 


0.64" 


Other 


44 (6%) 


4 (6%) 


40 (6%) 


i.oo" 


Risk Factors 


MSM 


344 (50%) 


47 (68%) 


297 (48%) 


0.001 " 


Heterosexual 


1 56 (23%) 


15 (22%) 


141 (23%) 


I.oo" 


IV Drug Use 


23 (3%) 


0 (0%) 


23 (4%) 


0.15" 


Transfusion 


8 (1%) 


0 (0%) 


8 (1%) 


i.o" 


Occupational 


3 (0%) 


0 (0%) 


3 (0%) 


i.o" 


Other 


1 7 (2%) 


0 (0%) 


1 7 (3%) 


0.4" 


Hepatitis B 


13 (2%) 


0 (0%) 


13 (2%) 


0.63" 


Hepatitis C+ (n = 689, 69, 620) 


54 (8%) 


2 (3%) 


52 (8%) 


0.15" 


Geographic location 


Mainland USA Subjects 


542 (79%) 


46 (67%) 


496 (80%) 


0.02" 


Puerto Rican Subjects 


41 (6%) 


3 (4%) 


38 (6%) 


0.79" 


Canadian Subjects 


107 (16%) 


20 (29%) 


87 (14%) 


0.003" 



'IWann-Whitney U test; 
"Fisher's exact test. 

The specific "n" Is noted when demographic data was obtained for a smaller number of subjects. Significant findings (p<0.05) are highlighted in bold. 
doi:1 0.1 371 /journal.pone.008961 1 .tOOl 



were from the same Canadian city and province; one of these 
VTCs also included 2 subjects from the same USA city and state. 
The remaining VTC had a more complex geographic distribution 
with all members in different locations across the USA. AH subjects 
in VTCs with more than 2 members were Hepatitis B/Hepatitis C 
negative and had similar ranges of CD4 and viral loads. 

Comparisons of HIV-1 Population Genotype Resistance 
Mutations 

By population genotyping, lAS-USA defined RT (including 
minor NNRTI mutations) and major PI HIV resistance-associated 
mutations were detected in virus from 151/621 (24%) of the non- 
clustered subjects, and in significandy fewer (p = 0.002) VTC 
subjects (6/69; 9%) as summarized in Table 4. Minor PI 
mutations were excluded from analysis since 96% of subjects 
had virus with one or more minor PI mutations. When minor 
NNRTI and NRTI mutations were excluded, major NRTI, 
NNRTI and PI mutations were detected in virus from 1 10 (18%) 
of the non-clustered subjects and from 6 (9%) of VTC subjects. 

NNRTI mutations were the most commonly detected resis- 
tance-associated mutations and the differences between VTC and 



non-clustered groups was significant (p = 0.003). Four (6%) VTC 
subjects had virus with NNRTI mutations that included the 
variants K103N, E138A, Y181C and H221Y. In contrast, 
significandy more non-clustered subjects had virus with NNRTI 
mutations (122/621 or 20% of viral samples). The mutations 
observed included all of the previously listed variants as well as 
V90I, A98G, KIOIE, KIOIP, K103S, V106I, V106M, V108I, 
E138G, E138K, V179D, Y188L, G190A, P225H and M230I. 

No virus from VTC subjects had NRTI mutations, compared 
with 29/621 (5%) of the non-clustered population. The NRTI 
viral mutations observed in non-clustered subjects included M41L, 
D67N, K70R, V75I, Q.151M, M184I/V, L210W, T215Y, and 
K219E/Q, There were two major PI mutation (M46I, Q53E) 
detected in two (3%) viruses from VTC subjects as compared with 
the subjects with non-clustered virus where 22/621 (4%) contained 
major PI mutations. In this latter group, most of the subjects (18/ 
22) had virus with a single major PI mutation in addition to minor 
variants. Within VTCs where major mutations were detected, no 
other subjects within the VTC had major mutations. However, 
minor PI mutation profiles of viruses within a VTC were relatively 
consistent (Table SI). 
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Table 2. Viral transmission clusters or VTC attributes and demographics. 





Cluster Attributes 


# of VTCs 


Tot3l ^ of subjects within these 
clusters 


>2 subjects 


4 


17 


Includes males 


29 


67 


Includes females 


4 


10 


Female only 


1 


2 


White (White/Caucasian/European Heritage) only 


18 


40 


Black (African Heritage} only 


6 


12 


2:2 races within cluster 


6 


17 


21 Hispanic subject 


8 


19 


21 subject MSM Risk Factor 


24 


57 


Includes both heterosexual and MSM Risk Factor 


5 


15 


21 subject with IVDU 


0 


0 


Heterosexual only Risk Factor 


4 


8 


All subjects in the VTC treated in the same state/province 


19 


38 


All subjects in the VTC treated in the same city 


17 


34 


22 subjects treated in the same city 


20 


47 


22 subjects treated in the same state/province 


22 


51 


Clusters that included subjects from mainland USA and Puerto 
Rico or subjects from mainland USA and Canada 


3 


9 


VTC that include 21 Canadian subjects^ 


8 


23' 


VTC that include 21 Puerto Rican subjects^ 


2 


4" 


HCV co-infected 


2 


7 


HBV co-infected 


0 


0 


21 subject CDC Class C 


7 


19 


21 subject HIV-RNA >100,000 copies/mL 


24 


57 


21 subject CD4>350 cells/mm' 


11 


27 


21 subject CD4<250 cells/mm^ 


19 


47 



'Not all subjects within each cluster were Canadian. 
"^Not all subjects within each cluster were Puerto Rican. 
doi:l 0.1 371 /journal.pone.008961 1 .t002 



Resistance Mutations Detected by UDS Analysis 

The HIV RT and protease UDS genotypic results obtained for 
the 69 VTC subjects were analyzed for the presence of lAS-USA- 
defined NNRTI, NRTI, and major PI resistance mutations 
(Table 4). UDS analysis detected NRTI, NNRTI or major PI 
resistance mutations in virus from 20/69 (29%) of VTC subjects 



(from 13 VTCs) compared to virus from 6/69 (9%) of VTC 
subjects detected by population sequencing (from 6 VTCs). Most 
mutations detected by UDS were at frequencies of <6%. When 
UDS results were filtered to include only mutations detected at a 
frequency of &6%, then 5 of the 6 VTC subjects identified by 
population genotyping as having virus with lAS-USA-defined 



Table 3. Comparison of viral transmission clusters (VTCs) with more than two subjects. 



VTC CD4 cells/mm^ Viral Load logm 

(n) Ethnicity(Count)^ Geographic Location(n)'' Risk Factors (n) (Min-Max) copies/mL (Mln-Max) 



3 


White (3) 


Canadian City/Province A (2), 
Canadian city/province B (1) 


MSM(3) 


132-318 


3.0-5.7 


4 


African-Heritage (Female), 
White (3) 


All Different USA Cities/States 


Hetero (Female), 
MSM (1), None (2) 


19-507 


3.3-6.1 


5 


White (5) 


Canada City/Province (3), 
US city/state X (2) 


MSM(5) 


153-282 


3.6-5.5 


5 


White (3), African-Heritage (1), 
Asian-Heritage (1) 


Canadian City/Province A (2), 
Canadian city/province B (3) 


MSM (3), Hetero (1), 
None (1) 


19-401 


4.3-5.8 



'All males unless otherwise annotated. 

'^Anonymized locations: A,B,X,. 

doi:1 0.1 371/journal.pone.008961 1 .t003 



PLOS ONE I www.plosone.org 



6 



February 2014 | Volume 9 | Issue 2 | e89611 



ART-Naive HIV-1 Phylogenetic Clustering Patterns 



Table 4. Summary of HIV resistance mutations obtained from ultra-deep sequencing (UDS) study. 





ARIES Cohort 


Total Subjects 
(n = e90) 


Subjects in VTCs 
{n = 69) 


Non VTC Subjects 
(n = e21) 


p-value 


HIV Population Genotype Resistance Mutations 


Any NRTI, NNRTI or major PI mutation 


157 (23%) 


6 (9%)"* 


151 (24%)"* 


0.002 


Major NRTI, NNRTI, PI mutation 


116 (17%) 


6 (9%)^* 


110 (18%)^"* 


0.06 


Any NRTI mutation 


29 (4%) 


0 (0%) 


29 (5%)^'* 


0.10 


Any NNRTI mutation 


126 (18%) 


4 (6%)"* 


1 22 (20%)"'* 


0.003 


Any major NNRTI mutation 


84 (12%) 


4 (6%)=* 


80 (13%)^" 


0.12 


Any major PI mutation 


24 (3%) 


2 (3%)"^* 


22 (4%)"*" 


1.00 


UDS HIV Genotype Resistance IVIutations (n = e9) 


Any NRTI, NNRTI or major PI mutation 


NA 


20 (29%)"= 


NA 


NA 


Major NRTI, NNRTI, PI mutation 


NA 


1 5 (22%)^"= 


NA 


NA 


Any NRTI mutation 


NA 


0 (0%) 


NA 


NA 


Any NNRTI mutation 


NA 


13 (19%)"'= 


NA 


NA 


Any major NNRTI mutation 


NA 


8 (12%)="= 


NA 


NA 


Any major PI mutation 


NA 


7 (10%)'*'= 


NA 


NA 



"Analyzed for HIV from VTC subjects only. 

"*RT: K103N, E138A, Y181C, H221Y; PR: M46I, Q58E. 

'"RT: M41L, D67N, K70R, V75I, V90I, A98G, KIOIE, KIOIP, K103N, K103S, V106I, V106M, V108I, E138A, E138G, E138K, Q151M, V179D, Y181C, M184I, M184V, Y188L, 
G190A, L210W, T215Y, K219E, K219Q, H221Y, P225H, M230I; PR: V32I, M46I, M46L, I54M, Q58E, V82A, N83D, 184V, L90M. 
"^RT: KlOlE, K103N, V106!, E138A, E138G, E138K, Y181C; PR: D30N, M46I, M46L, Q58E, V82A. 
"RT: K103N, E138A, Y181C, H221Y; PR: M46I, Q58E. 

^^''RT: M41L, D67N, K70R, V75!, KlOlE, KIOIP, K103N, K103S, V106M, V108I, E138A, E138K, E138G, Q151M, Y181C, M184!, M184V, Y188L, G190A, L210W, T215Y, K219E, 
K219Q, H221Y, P225H, M230I; PR: V32I, M46!, M46L, I54M, Q58E, V82A, N83D, 184V, L90M. 
^^'^RT: KlOlE, K103N, E138A, E138G, E138K, Y181C; PR: D30N, M46I, M46L, Q58E, V82A. 
'"RT: M41L, D67N, K70R, V75I, Q151M, M184I, M184V, L210W, T215Y, K219E, K219Q; PR: 
""RT: K103N, E138A, Y181C, H221Y; PR: 

""RT: V90I, A98G, KlOlE, KIOIP, K103N, K103S, V106!, V106M, V108I, E138A, E138K, E138G, V179D, Y181C, Y188L, G190A, H221Y, P225H, M230I; PR: ~. 
"'^RT: KlOlE, K103N, VI 061, E138A, E138K, E138G, Y181C; PR: -. 
="RT: K103N, E138A, Y181C, H221Y; PR: 

^''RT: KlOlE, KIOIP, K103N, K103S, V106M, V108I, E138A, E138G, E138K, Y181C, Y188L, G190A, H221Y, P225H, M230I; PR: 

^'^RT: KlOlE, K103N, E138A, E138G, E138K, Y181C; PR: 

'^"RT: K103N, E138A, Y181C, H221Y; PR: M46I, Q58E. 

'^''RT: PR: V32I, M46!, M46L, I54M, Q58E, V82A, N83D, 184V, L90M. 

'^'^RT: PR: D30N, M46I, M46L, Q58E, V82A. 

Fisher's Exact test was used to determine if there was a significant difference in the occurrence of resistance mutations between clustered and non-clustered subjects. 
Significant findings (p<0.05) are highlighted in bold. 
doi:l 0.1 371 /journal.pone.008961 1 .t004 



NNRTI, NRTI, and major PI resistance mutations were similarly 
identified using UDS, suggesting that the difference between UDS 
and PG was primarily due to newly detected mutations with <6% 
frequency. The remaining (6*) virus was identified by population 
sequencing as containing the major mutation H221Y but this 
mutation was not observed by UDS at a frequency above the 1 % 
cutoff. 

There were 18 additional NNRTI, NRTI, and major PI 
resistance mutation observed with UDS that were detected with a 
prevalence of between ai and <6% of which 12/18 (67%) were 
different types of NNRTI mutations (major: KlOlE, K103N, 
E138A, E138G, E138K; minor: KlOlE, VI 061, E138A, E138G, 
E138K). The remaining 6/18 types of mutations detected at low 
prevalence were major PI mutations (D30N, M46I, M46L, V82A). 
The newly detected mutations that were observed in > 1 viral 
genotype were the major PI mutations M46I (2 subjects) and 
V82A (2 subjects), and the RT mutation V106I (7 subjects). 

A total of 16 subjects had virus with NNRTI and major PI 
mutations detected by UDS that were not detected by population 
sequencing with the frequencies of <6%. For the major PI 
mutations, these were detected by UDS in virus from 6 subjects. 
These PI major mutations included D30N, M46I, M46L and 



V82A with the D30N and M46I mutations occurring in 2 separate 
samples from the same VTC. For 10 viral samples from 8 VTCs, 
NNRTI mutations were detected only by UDS and at frequencies 
<6% and included KlOlE, K103N, V106I, E138A, E138G, and 
E138K. 

Results on Therapy 

Of the 690 screened subjects, a total of 515 subjects met aU 
enrollment criteria and initiated antiviral therapy within the 
ARIES clinical study. Most (60/69; 87%) of the screened subjects 
in the VTCs were eligible and elected to enroll in the study. 
Treatment during the first 36 weeks included two nucleoside 
reverse transcriptase inhibitors (NRTIs; abacavir/lamivudine) and 
a ritonavir-boosted PI (atazanavir). Virologically suppressed 
subjects were randomized at Week 36 to discontinue the ritonavir 
from their ART regimen or to continue the original treatment 
regimen. Of the 515 enrolled subjects, 33 experienced cVF 
(defined as failure to suppress HIV-RNA to <400 copies/mL by 
Week 36 or confirmed viral rebound at any time through 144 
weeks on therapy from <400 copies/mL to &400 copies/mL). 
The majority (30/33) of cVF subjects had non-clustered viral 
sequences. Fourteen of these subjects met cVF by week 36, eight 
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subjects post week 36 through Week 84, and eight additional 
subjects post week 84 through week 144. 

In contrast, the 3 VTC subjects all met cVF relatively late in the 
study, after week 84, and were in different VTCs, including two in 
two-member VTCs and one from the five member VTC. The 3 
subjects with cVF were male, as were aU of the subjects within 
their VTCs (Table 5). All three subjects with cVF were negative 
for Hepatitis B and C. Within their respective VTCs, each had the 
highest viral load (VL) and the lowest CD4 in two of the VTCs. 
Two of these subjects with cVF reported MSM as a risk factor and 
the remaining subject had heterosexual risk. At the start of the 
study, by population sequencing, only one of these subjects who 
later met cVF had virus with a major NNRTI resistance mutation 
(Y181C) but all three had virus with minor PI mutations. UDS 
analysis confirmed all cVF population sequencing mutations and 
also detected minor PI resistance-associated mutations in virus at 
low prevalence (<6%) in one subject. Within these 3 VTCs, one 
subject from a 2 member cluster failed to meet all study eligibility 
requirements and was not enrolled, but all of the non-cVF subjects 
who were in the remaining 2 VTCs initiated drug treatment and 
maintained sustained viral suppressed with HIV-RNA <50 logio 
copies/mL at Week 144. 

Discussion 

Phylogenetic analysis of HIV using population sequencing and 
UDS data identified 40 and 31 VTCs, respectively, from 690 
antiviral-naive HIV-infected subjects residing in Puerto Rico, 
Canada and mainland USA. A total of 30 VTCs comprising 69 
subjects were commonly identified by both DNA sequencing 
methods. Discordances in the number and content of VTCs are 
likely due to minor differences in the HIV regions sequenced by 
two approaches as well the number of subjects. Furthermore, 
population sequencing data could contain nucleotide mixtures 
while the consensus UDS sequence is created from the majority 
nucleotide codon sequence. 

The majority of subjects in both the non-clustered population 
and VTCs were male and white. There were significantly more 
subjects within the VTCs who reported MSM risk factors or from 
sites in Canada. While many VTC subjects were receiving 
treatment in the same city or state/province, 37% of the VTCs 
included subjects who resided in different geographical locations, 
including cluster composed of subjects from both the mainland 



USA and Puerto Rico or from both USA and Canada. Our study 
does not have an extensive longitudinal pre-therapy collection 
period (all samples were collected in 2007) so the finding that 10% 
(69/690) of the screening population were part of VTCs and that 
these networks sometimes encompassed wide geographic areas was 
unexpected. Recently published findings from a large Center for 
Aids Research (CFAR) network phylogenetic analysis at five U.S. 
sites which included both ART-naive and ART-experienced 
patients had a higher proportion (24%) of VTCs, with the majority 
of VTCs (89%) confined to a single site, with <11% of VTCs 
encompassing two sites and only one VTC associated across three 
sites [22]. Clustering was associated with the lack of ART use (p< 
0.001) as well as being marginally associated with MSM/IDU risk 
behaviors (p = 0.06). Viral phylogenetic analysis of newly diag- 
nosed HIV-infected patients from 25 European countries and 
Israel European participating in the SPREAD project found that 
31.2% were part of a VTC, with the majority of these (74.2%) 
from within a single European country, while 15.6% clustered 
with individuals from countries without a common border [23]. 
Clustering was also significandy associated with MSM behavior 
(p<0.0001) and subtype B viral infection (p<0.0001). 

A possible consequence of rapid HIV transmission through 
VTCs could have been a local increase in the transmission of drug 
resistant strains, as sexual transmission of HIV has been reported 
to select for highly fit drug-resistant and persistent variants [31]. In 
the European/Israeli SPREAD surveillance project, transmitted 
drug resistance was significandy more prevalent (p = 0.03) in VTC 
than non-clustered subjects. However, this was not observed in our 
study or in the USA Centers for AIDS Research (CFAR) 
surveillance study [22] where patients in VTCs were less likely 
to have resistance mutations (p<0.001) than patients with non- 
clustered sequences. There is a much higher proportion of 
Subtype B virus in the latter two studies (>96%) than in the 
SPREAD study (66%). The SPREAD study also found numerous 
differences between patients infected with subtype B virus 
compared to non-subtype B with subtype B subjects more often 
MSM and recently infected (both P<0.001) [23]. Drug resistance 
was detected in virus from the non-clustered population at a 
higher prevalence (24%) than in the VTCs (6%). In particular, 
major PI mutations were rarely observed and no NRTI mutations 
occurred within the VTCs which is consistent with data from other 
transmission networks [4,14]. One possible explanation could be 



Table 5. Dennographics for confirmed virologic failure (cVF) containing VTCs. 



VTC 

(n) 


Race/ 

Ethnicity 

(Count) 


Geographic 
Location (n) 


Risk Factors 
(n) 


CD4 cells/ 
(Min-Max) 


Viral Load 
logic 

copies/mL 
(Min-Max) 


PG Resistance 
mutations 


Additional UDS 

Resistance 

mutations 


2 


White(l), Mixed 

Race/Hispanic 

(cVF) 


Same USA state 
but different cities 


Heterosexual 
(cVF), None (1) 


185 (cVF) - 
731 


4.6-4.8 (cVF) 


PI minor: V77I, I93L (cVF); 
PI minor: V77I, I93L 


None 


2 


White(2) 


Same USA city 


MSM (cVF), 
MSM/ 

Hetrosexual(l) 


49 (cVF) - 
234 


5.0-5.9 (cVF) 


Major NNRTI: Y181C, PI Minor: 
L10l,l62V,V77l,l93t (cVF); PI 
Minor: L10l,l62V,L63P,V77l,l93t 


None (cVF); PI minor: 
M36l(6%), I64L(10%), 
PI Major: V82l(l%) 


5" 


White(3+cVF), 

White/Hispanic 

(1) 


Canada City/Province 
B (3), 

USA city/state (l+cVF) 


MSM{5) 


1 53-282; 
215(cVF) 


3.6 - 5.5(cVF) 


PI Minor": I62V,L63P,A71T,I93L; 
PI Minor: M36I,I62V,I93L (cVF); 
PI Minor: L63P,V77I,I93L; PI 
Minor: L63P,V77I,I93L; V77I,I93L 





^Non-cVF subject Hepatitis C positive. 

''PI Minor: G16E(2%), M36l(2%), !64L(1%); PI Minor: L10l(2%), L63P(3%), V82!(l%) (cVF); NNRTI Minor: V106!(4%); NNRTI Minor: V106!(l%), PI Minor: A71T(1%), A71V(1%), 
82(13%); PI Major: M46l(2%), PI Minon L10l(l%), L63P(1%). 
doi:l 0.1 371 /journal.pone.008961 1 .t005 
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the reduced fitness of virus bearing major PI mutations as 
compared with viruses bearing NNRTI mutations which can 
persist in vivo for years in the absence of direct drug pressure 
[32,33]. 

The relatively lower incidence of transmitted drug resistance 
within the VTC population as compared to the incidence of 
transmitted drug resistance detected in the non-clustered popula- 
tion could be attributed to the spread of the virus from source 
partners who are unaware of their infection status. This would be 
consistent with other reports of onward transmission in the MSM 
population in Brighton, UK and Montreal, Canada which have 
suggested that most of the new infections in these communities 
appear to arise from individuals whose infection was undiagnosed 
at the time of transmission or those recently infected with a higher 
viral load [32,33]. In our study, HIV-1 infected Canadian subjects 
were significantly more likely to be part of a VTC than HIV- 
infected subjects from either the continental U.S. or Puerto Rico. 
This could be due to better intervention strategies on the part of 
Canadian healthcare authorities to encourage HIV testing for at 
risk individuals as well as increased emphasis on early treatment 
for HIV-infected subjects. A recent analysis of primary* or early 
infection in HIV-infected MSM subjects from Montreal observed 
an ongoing increase in the detection of larger clusters containing 
more than 5 members which was attributed to a disproportionate 
contribution of subjects in the primary or early stage infection in 
outward transmission [2]. This would suggest that intervention 
strategies may not have been uniformly successful. Data from an 
earlier study on HIV transmission in MSM living in Montreal 
indicated that 57.8% of study subjects who engaged in non-couple 
sex (n = 965) were unaware of their partner's HIV status at last 
sexual encounter. For MSM subjects who engaged in single night 
only encounters (n = 465) that percentage increased to 79.1% [34] . 
Another earlier study in HIV-1 serodiscordant couples in Africa 
demonstrated that transmission rates in persons diagnosed with 
AIDS are increased due to high plasma HIV-1 RNA during this 
period of time, which underscores the importance of early 
detection and treatment in persons infected with HIV [35]. 

The VTC demographic and genot)ping results from our study 
are in line with models which suggest that HIV can be spread by 
HIV-infected persons who may be unaware of their infection 
status. Limitations of our analysis are the lack of behavioral or 
duration of infection data which could have provided additional 
insights. Additionally, there could be potential subject selection 
bias since our study was restricted to patients who had elected to 
seek HIV treatment through clinical trial enrollment. 

Phylogenetic analyses of population sequence and UDS data 
produced similar results for VTC predictions. The respective VTC 
cut-off values of 95°/i) bootstrap NJ tree replicates is highly 
conservative. Relaxing those constraints could broaden the size 
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